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^ (54) Title: HEPATITIS B CORE ANTIGEN FUSION PROTEINS 

00 (57) Abstract: The hepatitis B virus (HBV) capsid is made up of a single species of protein called the core antigen (HBcAg) which 
l/^ self-assembles into particles. The particles are highly immunogenic and are able to present heterologous epitopes to the immune 

system when the epitopes arc inserted into a surface-exposed region of the particles called the "el loop". The structural building 
^ blocks of the particles are lightly associated dimers of HBcAg in which the adjacent el loops are closely juxtaposed. It is proposed 

that sequences inserted into the el loop are conformationally restrained in the assembled particles when presented in monomelic 

core protein. The invention seeks to solve this problem by covalently linking core proteins as tandem copies, e.g., as dimers, so 
* that insertions can be made independently in each copy. This is particularly useful for insertion of large sequences into the el loop 
Q because it allows such sequences to be inserted into just one copy of the core protein per tandem repeat, thereby reducing potential 

conformational clashes in assembly. Alternatively, a different sequence may be inserted into each el loop of a tandem repeat, thus 
^ increasing the flexibility of HBcAg particles as an epitope delivery system. 
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HEPATITIS B CORE ANTIGEN FUSION PROTEINS 

The invention relates to hepatitis B core antigen fusion proteins, particles containing the 
proteins, nucleic acid molecules encoding the proteins, processes for producing the 
5 proteins, pharmaceutical compositions containing the proteins and use of the proteins in 
prophylactic and therapeutic vaccination. 

Background to the invention 

10 Hepatitis B is a major healthcare problem throughout both the developed and developing 
world. Infection with the hepatitis B virus (HB V) can result in an acute or chronic disease 
which in a proportion of cases may lead to hepatocellular carcinoma and death. The virus 
is double shelled, and its DNA is protected inside a protein structure called the core 
antigen (HBcAg). The core is surrounded by the envelope protein known as the surface or 

1 5 S antigen (HBsAg). HBcAg is an unusual antigen which can be used as a delivery vehicle 
for specific peptides to the immune system. The antigen has been used to present 
T-helper, B and cytotoxic lymphocyte (CTL) epitopes from a variety of viral and bacterial 
pathogens, including epitopes from the surface antigen of HBV, envelope proteins from 
hepatitis A virus and antigens from hepatitis C virus. For a review see Ulrich et al (1998) 

20 Advances in Virus Research 50 141-182. 

HBcAg is an excellent vehicle for the presentation of epitopes due to the molecular 
structure of the protein, which self-assembles into particles. Each particle is generated 
from either 180 or 240 copies of a monomelic polypeptide. The monomer, on reaching an 
25 appropriate concentration inside the host cell, forms a particle of approximately 27 nm in 
diameter. Structural studies have shown that amino acids within the region from residues 
68 to 90 form a spiked structure on the surface of the particle which is known as the el 
loop. Two monomers joined by disulphide bonds link to form a dimer spike, the most 
exposed amino acid being at position 80 (at the centre of the el loop). 

30 
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EP-A-421635 (The Wellcome Foundation Limited) describes modification of the HBV 
core gene to allow insertion of foreign epitopes into the el loop without altering the 
potential of the protein to from particles. Insertion at this site allows maximum exposure 
of the inserted epitope on the tip of each spike created by dimers of the protein. As there 
5 are approximately 180 or 240 copies of each monomer per particle, each particle is able to 
present 180 or 240 copies of the epitope of interest. 

Summary of the invention 

10 In the dimers of HBcAg which form the structural building blocks of core particles, 
adjacent el loops are closely juxtaposed. It is proposed that sequences inserted into the el 
loop are conformationally restrained in the assembled particles when presented in 
monomelic core protein. The invention seeks to solve this problem by covalently linking 
core proteins as tandem copies, e.g. as dimers, so that insertions can be made 

15 independently in each copy. This is particularly useful for insertion of large sequences 
into the el loop because it allows such sequences to be inserted into just one copy of the 
core protein per tandem repeat, thereby reducing potential conformational clashes in 
assembly. Alternatively, a different sequence may be inserted into each el loop of a. 
tandem repeat, thus increasing the flexibility of HBcAg particles as an epitope 

20 presentation system. 

Thus, the invention provides a protein comprising tandem copies of HBcAg. The protein 
is generally a dimer comprising two copies of HBcAg. A heterologous epitope may be 
inserted into the el loop of one or more of the copies of HBcAg. The protein assembles 
25 into particles which present the heterologous epitope inserted in the el loop on their 
surfaces and are useful in the prophylactic and Uierapeutic vaccination of humans and 
animals. 



30 



WO 01/77158 



PCT/GB01/01607 



Detailed description of the invention 
The protein 

5 

The basic building block of the protein of the invention is HBcAg, which has 183 or 185 
amino acids (aa) depending on the subtype of HBV. The sequence of the 183 amino acid 
protein of the ayw subtype plus a 29 amino acid pre-sequence is shown in SEQ ID No. 2. 
The mature HBcAg runs from the Met residue at position 30 to the Cys residue at the 
10 extreme C-tenninus, with the sequence from positions 1 to 29 being a pre-sequence. 

The protein generally comprises only two copies of HBcAg forming a dimer because 
dimers of HBcAg form the structural building blocks of core particles. However, the 
protein may comprise further copies of HBcAg. Thus, the protein may comprise from 2 to 
15 8 copies or from 2 to 4 copies of HBcAg, The use of more than two copies increases the 
flexibility of the system; for example, the use of three copies allows three different 
epitopes to be inserted into three el loops in the protein of the invention and thereby 
increases the breadth of the immune response induced by the protein of the invention. 

20 The HBcAg units are generally joined together in a head-to-toe fashion, i.e. the C- 
terminus of one unit is joined to the N-terminus of the adjacent unit. The units may be 
joined directly by a covalent bond (e.g. a peptide bond), but preferably they are joined by a 
linker which spaces the adjacent units apart and thereby prevents any problem with 
disruption of the packing of adjacent units. The nature of the linker is discussed below. 

25 

One or more of the HBcAg units in the protein of the invention may be native full length 
HBcAg. However, generally at least one of the units is a modified form of HBcAg, for 
example HBcAg modified by insertion of a heterologous epitope in the el loop. In dimers 
according to the invention, one of the HBcAg units may be modified and the other may be 
30 native HBcAg. 
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As a general rule, any modifications are chosen so as not to interfere with the 
conformation of HBcAg and its ability to assemble into particles. Such modifications are 
made at sites in the protein which are not important for maintenance of its conformation, 
for example in the el loop, the C-terminus and/or the N-terminus. The el loop of HBcAg 
5 can tolerate insertions of e.g. from 1 to 120 amino acids without destroying the particle- 
forming ability of the protein. 

.The HBcAg sequence may be modified by a substitution, insertion, deletion or extension. 
The size of insertion, deletion or extension may, for example, be from 1 to 200 aa, from 3 

10 to 100 aa or from 6 to 50 aa. Substitutions may involve a number of amino acids up to, for 
example, 1, 2, 5, 10, 20 or 50 amino acids over the length of the HBcAg sequence. An 
extension may be at the N- or C-terminus of HBcAg. A deletion may be at the N- 
terminus, C-terminus or at an internal site of the protein. Substitutions may be made at 
any position in the protein sequence. Insertions may also be made at any point in the 

1 5 protein sequence, but are typically made in surface-exposed regions of the protein such as 
the el loop. An inserted sequence may carry a heterologous epitope. More than one 
modification may be made to each HBcAg unit. Thus, it is possible to make a terminal 
extension or deletion and also an internal insertion. For example, a truncation may be 
made it the C-terminus and an insertion may be made in the el loop. 

20 

Substitutions will generally be conservative and may be made, for example, according to 
the following Table, in which amino acids in the same block in the second column and 
preferably in the same line in the third column may be substituted for each other. 



ALIPHATIC 


Non-polar 


GAP 
ILV 




Polar-uncharged 


CSTM 
NQ 




Polar-charged 


DE 
KR 


AROMATIC 


HFWY 
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Each part of the HBcAg sequence in the protein of the invention preferably has at least 
70% sequence identity to the corresponding sequence of a natural HBcAg protein, such as 
the protein having the sequence shown in SEQ ID NO: 2. More preferably, the identity is 
5 at least 80%, at least 90%, at least 98%, at least 97% or at least 99%. Methods of 
measuring protein sequence (and nucleic acid Sequence) identity are well known in the art. 
For example, the UWGCG Package provides the BESTFIT programme (Devereux et al 
(1984) Nucleic Acids Research 12, p.387-395). Similarly, the PILEUP and BLAST 
algorithms can be used to line up sequences (for example as described in Altschul S. F. 
10 (1993) J: Mol Evol 36:290-300 and Altschul, S. F. etal {1990)1 Mol Biol 215:403-10). 

The el loop of HBcAg is at positions 68 to 90, and a heterologous epitope may be inserted 
anywhere between these positions. Preferably, the epitopeds inserted in the region from 
positions 69 to 90, 71 to 90 or 75 to 85. Most preferred is to insert the epitope between 

15 amino acid residues 79 and 80 or between residues 80 and 81 . When a heterologous 
epitope is inserted, the entire sequence of HBcAg may be maintained, or alternatively the 
whole or a part of the el loop sequence may be deleted and replaced by the heterologous 
sequence. Thus, amino acid residues 69 to 90, 71 to 90 or 75 to 85 may be replaced by a 
heterologous epitope. Where a heterologous epitope replaces el loop sequence, the 

20 epitope is generally not shorter than the sequence that it replaces. 

A C-terminal truncation of HBcAg will generally not go beyond aa 144 because if any 
further truncation is made particles may not form. Thus, the deleted amino acids may, for 
example, comprise aa 144 to the C-terminal aa (aa 183 or 185), aa 150 to the C-terminal 
25 aa, aa 164 to the C-terminal aa or aa 172 to the C-terminal aa. The C-terminus of HBcAg 
binds DNA, and truncation of the C-terminus therefore reduces or completely removes 
DNA from preparations of HBcAg and HBcAg hybrid proteins. 

The protein of the invention forms particles which preferably resemble the particles 
30 formed by native HBcAg. The particles of the invention are typically at least 10 run in 
diameter, for example from 10 to 50 nm or from 20 to 40 nm in diameter, but preferably 
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they are about 27 cm in diameter (which is the size of native HBcAg particles). They 
comprise multiple HBcAg units, for example from 150 to 300 units, but generally they are 
fixed to about 1 80 or about 240 units (which are the numbers of units in native HBcAg 
particles). Where the protein of the invention is a dimer, this means that the number of 
5 protein monomers in the particles may be from 75 to 150 but is generally about 90 or 
about 120. 

The linker between adjacent HBcAg units is generally a chain of amino acids at least 
1.5 nm (15 A) in length, for example from 1.5 to 10 nm, from 1.5 to 5 nm or from 1.5 to 3 

10 nm. It may, for example, comprise 4 to 40 aa or 10 to 30 aa, preferably 15 to 21 aa. The 
linker is generally flexible. The amino acids in the linker may, for example, include or be 
entirely composed of glycine, serine and/or proline. A preferred linker comprises one or 
more repeats of the sequence GlyGlySer (GGS). Alternatively, the linker may comprise 
one or more GlyPro (GP) dipeptide repeats. The number of repeats may, for example, be 

1 5 from 1 to 1 8, preferably from 3 to 12. In the case of GGS repeats, the use of 5, 6 or 7 
repeats has been found to allow the formation of particles. The linker may correspond to 
the hinge region of an antibody; this hinge region is thought to provide a flexible joint 
between the antigen-binding and tail domains of antibodies. 

20 As indicated above, a heterologous epitope may be inserted into one or more of the copies 
of HBcAg in the protein of the invention, preferably into the el loop. A "heterologous" 
epitope is an epitope that is not normally located at the position at which it is located in the 
HBcAg; it is generally from a protein other than HBcAg but it may be from a different 
location in HBcAg. The epitope comprises a sequence of amino acids which raises an 

25 immune response. The epitope may be conformational or linear. It may be, for example, 
in a sequence of from 6 to 120 aa, from 6 to 50 aa or from 6 to 20 aa. A major advantage 
of the invention is that it allows epitopes carried on large sequences to be inserted into the 
el loop, for example on sequences of from 30 to 120 aa, 40 to 120 aa or 60 to 120 aa. 

30 The protein of the invention may contain more than one heterologous epitope, for example 
up to 2, 3, 5 or 8 heterologous epitopes, and in this case the epitopes may be present in the 
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same or different HBcAg units. More than one copy of an epitope may be inserted in each 
HBcAg unit; for example, from 2 to 8 copies may be inserted. Where there are two or 
more heterologous epitopes in the protein of the invention, they may be from the same or 
different organisms and from the same or different proteins. ■ 

5 

The epitope may be a T-cell or a B-cell epitope. If it is a T-cell epitope, it may be a 
cytotoxic T-lymphocyte (CTL) epitope or a T-helper (Th) cell epitope (e.g. a Thl or Th2 
epitope). In a preferred embodiment of the invention, one of the epitopes is a T-helper cell 
epitope and another is a B-cell or a CTL epitope. The presence of the T-helper cell 
10 epitope enhances the immune response against the B-cell or CTL epitope. 

The choice of epitope depends on the disease that it is wished to vaccinate against. The 
epitope may, for example, be from a pathogenic organism, a cancer-associated antigen or 
an allergen. The pathogenic organism may, for example, be a virus, a bacterium or a 
15 protozoan. 

Examples of pathogens whose epitopes may be inserted include hepatitis A virus (HAV), 
HBV, HCV, influenza virus, foot-and-mouth disease virus, poliovirus, herpes simplex 
virus, rabies virus, feline leukemia virus, human immunodeficiency virus type 1 (HIV1), 

20 human immunodeficiency virus type 2 (HTV2), simian immunodeficiency virus (SIV), 
human rhinovirus, dengue virus, yellow fever virus, human papilloma virus, respiratory 
syncytial virus, Plasmodium falciparum (a cause of malaria), and bacteria such as 
Mycobacteria, Bordetella, Salmonella, Escherichia, Vibrio, Haemophilus, Neisseria, 
Yersinia and Brucella. Specifically, the bacterium may be Mycobacterium tuberculosis - 

25 the cause of tuberculosis; Bordetella pertussis or Bordetella parapertussis - causes of 
whooping cough; Salmonella typhimwium - the cause of salmonellosis in several animal 
species; Salmonella typhi - the cause of human typhoid; Salmonella enteritidis - a cause of 
food poisoning in humans; Salmonella choleraesuis - a cause of salmonellosis in pigs; 
Salmonella dublin - a cause of both a systemic and diarrhoeal disease in cattle, especially 

30 in new-born calves; Escherichia coli - a cause of food poisoning in humans; Haemophilus 
influenzae - a cause of meningitis; Neisseria gonorrhoeae - a cause of gonnoirhoeae; 
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Yersinia enterocolitica - the cause of a spectrum of diseases in humans ranging from 
gastroenteritis to fatal septicemic disease; and Brucella abortus - a cause of abortion and 
infertility in cattle and a condition known as undulant fever in humans. 

5 Examples of candidate epitopes for use in the invention include epitopes from the 
following antigens: the HTV antigens gp 120, gp 160, gag, pol, Nef, Tat and Ref; the 
malaria antigens CS protein and Sporozoite surface protein 2; the influenza antigens HA, 
NP and NA; the herpes virus antigens EBV gp340, EBV gp85, HSV gB, HSV gD, HSV 
gH, HSV early protein product, cytomegalovirus gB, cytomegalovirus gH, and IE protein 

10 gP72; the human papilloma virus antigens E4, E6 and E7; the respiratory syncytial virus 
antigens F protein, G protein, and N protein; the pertactin antigen of B.pertussis; the 
tumor antigens carcinoma CEA, carcinoma associated mucin, carcinoma P53, melanoma 
MPG, melanoma P97, MAGE antigen, carcinoma Neu oncogene product, prostate specific 
antigen (PSA), prostate associated antigen, ras protein, and myc; and house dust mite 

15 allergen. 

Especially preferred epitopes are those from the pre-Sl region, the pre-S2 region, the S 
. region or core antigen of HBV. It is possible to insert the whole of the pre-S 1 and/or the 
whole of the pre-S2 region into HBcAg, but generally only a part of one of the regions is 

20 inserted. The inserted part is typically at least 6 amino acids in length, for example from 6 
to 120 aa, 20 to 80 aa or 20 to 50 aa. The insert may include, for example, the residues at 
pre-Sl positions 1-9, 10-19, 20-29, 30-39, 40-49, 50-59, 60-69, 70-79, 80-89, 90-99, 100- 
109 or 110-119 or the residues at pre-S2 positions 120-129, 130-139, 140-149, 150-159, 
160-169 or 170-174. Particularly preferred inserts are pre-Sl residues 20-47 and pre-S2 

25 residues 139-174. 

Making the proteins of the invention 

The proteins of the invention are generally made by recombinant DNA technology. The 
30 invention includes a nucleic acid molecule (e.g. DNA or RNA) encoding a protein of the 
invention, such as an expression vector. The nucleic acid molecules may be made using 
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known techniques for manipulating nucleic acids. Typically, two separate DNA 
constructs encoding two HBcAg units are made and then joined together by overlapping 
PCR. 

5 A protein of the invention may be produced by culturing a host cell containing a nucleic 
molecule encoding the protein under conditions in which the protein is expressed, and 
recovering the protein. Suitable host cells include bacteria such as E. coli, yeast, 
mammalian cells and other eukaryotic cells, for example insect Sf9 cells. 

10 The vectors constituting nucleic acid molecules according to the invention may be, for 
example, plasmid or virus vectors. They may contain an origin of replication, a promoter 
for the expression of the sequence encoding the protein, a regulator of the promoter such 
as an enhancer, a transcription stop signal, a translation start signal and/or a translation 
stop signal. The vectors may also contain one or more selectable marker genes, for 

1 5 example an ampicillin resistance gene in the case of a bacterial plasmid or a neomycin 
resistance gene in the case of a mammalian vector. Vectors may be used in vitro, for 
example for the production of RNA or used to transform or transfect a host cell. The 
vector may also be adapted to be used in vivo, for example in a method of gene therapy or 
DNA vaccination. 

20 

Promoters, enhancers and other expression regulation signals may be selected to be 
compatible with the host cell for which the expression vector is designed. For example, 
prokaryotic promoters may be used, in particular those suitable for use in E. coli strains 
(such as E. coli HB 1 01). A promoter whose activity is induced in response to a change in 

25 the surrounding environment, such as anaerobic conditions, may be used. Preferably an 
htrA or nirB promoter may be used. These promoters may be used in particular to express 
the protein in an attenuated bacterium, for example for use as a vaccine. When expression 
of the protein of the invention is carried out in mammalian cells, either in vitro or in vivo, 
mammalian promoters may be used. Tissue-specific promoters, for example hepatocyte 

30 cell-specific promoters, may also be used. Viral promoters may also be used, for example 
the Moloney murine leukaemia virus long terminal repeat (MMLV LTR), the rous 
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sarcoma virus (RS V) LTR promoter, the S V40 promoter, the human cytomegalovirus 
(CMV) IE promoter, herpes simplex virus promoters and adenovirus promoters. All these 
promoters are readily available in the art. 

5 A protein according to the invention may be purified using conventional techniques for 
purifying proteins. The protein may, for example, be provided in purified, pure or isolated 
form. For use in a vaccine, the protein must generally be provided at a high level of 
purity, for example at a level at which it constitutes more than 80%, more than 90%, more 
than 95% or more than 98% of the protein in the preparation. However, it may be 
10 desirable to mix the protein with other proteins in the final vaccine formulation. 

Vaccination against diseases 

The primary use of the proteins of the invention is as therapeutic or prophylactic vaccines. 
15 The invention includes a pharmaceutical composition (e.g. a vaccine composition) 

comprising a protein of the invention, a particle of the invention or a nucleic acid molecule 
of the invention and a pharmaceutically acceptable carrier or diluent. 

The principle behind prophylactic vaccination is to induce an immune response in a host 
20 so as to generate an immunological memory in the host. This means that, when the host is 
exposed to the virulent pathogen, it mounts an effective (protective) immune response, i.e. 
an immune response which inactivates and/or kills the pathogen. The invention could 
form the basis of a prophylactic vaccine against a range of diseases and conditions, such as 
HBV, HAV, HCV, influenza, foot-and-mouth disease, polio, herpes, rabies, AIDS, dengue 
25 fever, yellow fever, malaria, tuberculosis, whooping cough, typhoid, food poisening, 
diarrhoea, meningitis and gonorrhoea. The epitopes in the protein of the invention are 
chosen so as to be appropriate for the disease against which the vaccine is intended to 
provide protection. 

30 The principle behind therapeutic vaccination is to stimulate the immune system of the host 
to alleviate or eradicate a disease or condition. There are a number of diseases and 
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conditions which may be susceptible to therapeutic vaccination, such as chronic viral 
diseases including chronic HBV and chronic HCV, cancer, and allergies such as asthma, 
atopy, eczema^ rhinitis and food allergies. 

5 Chronic viral diseases arise when the imihune system of an infected host fails to eliminate 
the virus, allowing the virus to persist in the host for a long period of time. The invention 
may be used to induce the immune system of the chronically infected individual so as to 
eliminate the virus. For example, is believed that patients with chronic hepatitis have an 
inadequate T-cell response, and that stimulation of an appropriate T-cell response can 
1 0 eliminate the virus. Thus, in order to treat chronic viral hepatitis using the invention, T- 
cell epitopes may be inserted into the protein of the invention, such as T-cell epitopes from 
the pre-Sl and pre-S2 regions of HBV. 

Similarly, in the case of cancer, it is believed that enhancement of the T-cell response to 
1 5 tumour antigens may help the immune system to destroy the tumour. It is believed that 
allergic diseases are caused at least in part by an unbalanced T-cell response in which an 
inflammatory Th2 responses dominates over an antagonistic Thl response, and that 
allergies may therefore be treated by enhancing the Thl response. This can be achieved 
according to the invention by using a protein containing a heterologous epitope which 
20 stimulates a Thl response. 

Suitable carriers and diluents for inclusion in pharmaceutical compositions of the 
invention are isotonic saline solutions, for example phosphate-buffered saline. The 
composition will normally include an adjuvant, such as aluminium hydroxide. The 

25 composition may be formulated in liquid form for injection. The composition comprises 
the protein, particles or nucleic acid in a prophylactically or therapeutically effective 
amount. Typically, the protein or particles are administered in a dose of from 0.1 to 200 
fig, preferably from 1 to 100 [ig, more preferably from 10 to 50 |ig body weight. The 
nucleic acid of the invention may be administered directly as a naked nucleic acid 

30 construct using techniques known in the art or using vectors known in the art. The amount 
of nucleic acid administered is typically in the range of from 1 fig to 10 mg, preferably 
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from 100 jig to 1 mg. The vaccine may be given in a single dose schedule or a multiple 
dose schedule, for example in from 2 to 32 or from 4 to 16 doses. The routes of 
administration and doses given above are intended only as a guide, and the route and dose 
may ultimately be at the discretion of the physician. 

5 

Experimental Section 

Brief description of the drawings 

10 Figure 1: A hypothetical model showing the feasibility of a linked AB dimer of hepatitis 
B core. 

Figure 2: A schematic representation of the construction of helero- and homo-tandem 
cores. The bars represent the primary structures of the proteins. Within the assembly 
15 domain of HBcAg (amino acids 1-144), the el loop (black rectangle) and the regions 
involved in intradimer (light shading) and interdimer (dark shading) contacts are indicated. 
The Arg-rich nucleic acid binding domain is symbolised by +. Primers (fable 1) are 
indicated as arrows. 

20 Figure 3: A 12% SDS-PAGE of fractions from a sucrose density gradient separation of 
homo-tandem core particles. 

Figure 4: Electron micrograph of hetero-tandem core particles with a linker comprising 
five repeats of GGS. 

25 

Figure 5: A Western blot showing the efficient expression of hetero-tandem cores in 
Ecoli. The cores contained 5, 6 and 7 GGS repeats as the linker respectively (GGS5, 
GGS6 and GGS7). 

30 Figure 6: The results of cryo-electronmicroscopy of tandem core particles. Figure 6(a) 
shows tandem core particle (the left-hand particle) in comparison with a native particle 
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(the middle particle), the right-hand part of Figure 6(a) shows the C-terminal part of core 
antigen in a tandem core particle. Figure 6(b) shows the fitting of a portion of the 
structure of a tandem core, particle with a native particle. 

5 Methods 

Examination of the HBV core particle structure suggested that a flexible linker of at least 
1 .5nm (15 A) could be used to link the two proteins in a dimer. pair without disrupting 
their structural integrity (Figure 1). Consequently, constructs were made by overlapping 
10 PCRs in which the upstream core protein was truncated to residue 149 and then linked to a 
downstream copy via 5, 6 or 7 copies of a GlyGlySer (GGS) repeat sequence (Figure 2). 
The downstream copy was either the full length core protein or was truncated at amino 
acid 149 to remove the Arg-rich C-terminal region. Table 1 gives the oligonucleotide 
sequences used to construct the various HBV tandem cores. 

15 

The constructs were cloned into ptrc99A expression vector, transformed into Kcoli JM 
109 and induced with IPTG. Cells were then harvested by centrifugation, resuspended 
into PBS and sonicated twice. Lysates containing soluble expressed tandem cores were 
made 30% saturated ammonium sulphate and the precipitated proteins collected by 
20 centrifugation, resuspended into PBS and dialysed against phosphate-buffered saline. The 
clarified lysate was loaded onto 15-45% linear sucrose gradients and centrifuged at 28,000 
rpm for 4 hours at 4°C. Gradients were fractionated from the bottom of the tube into 2 ml 
aliquots and analysed by SDS-PAGE and Western Blotting using a monoclonal primary 
antibody against HBV core protein (mAb 13). 

25 

HBV core particle preparations were spotted onto carbon coated grids, negatively stained 
with uracyl acetate and visualized in transmission electron microscopy. The structures of 
the core particles were determined using cryo-electronmicroscopy. 

30 
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Table 1: Sequences of the oligonucleotide primers used for cloning HBV tandem core 
genes into ptrc99A. 



5 


Primers 


Sequences (5'-^3') 




1 


GTTACCATGGACATTGACCCTTAT* 




2 


GTCCATAGA(ACCACCAGA) 5 AACAACAGTAGTTTCCGG 




3 


GTCCATAGA(ACCACCAGA) ( 5AACAACAGTAGTTTCCGG 




4 


GTCCATAGA(ACCACCAGA) 7 AACAACAGTAGTTTCCGG 


10 


5 


GTTGTT(GGTGGTTCT) 5 ATGGACATTGACCCTTAT 




6 


GTTGTT(GGTGGTTCT) 6 ATGGACATTGACCCTTAT 




7 


GTTGTT(GGTGGTTCT) 7 ATGGACATTGACCCTTAT 




8 


TATGAAGCTTATGAGTCCAAGGA b 




9 


TATGAAGCTTCCGTCGTCAAACAA" 



15 



a Ncol restriction site is boldfaced 
b Hindlll restriction site is boldfaced 

Results 

20 

Tandem HBV core proteins with 5, 6 or 7 copies of GGS were all expressed successfully 
and were shown to migrate in polyacrylamide gels with the expected mobilities. Each 
assembled into eore particles as evidenced by their sedimentation in sucrose density 
gradients (Figure 3) and their appearance in the electron microscope (Figure 4). The 
25 particles retained their antigenic properties as demonstrated by their reactivity in ELISA 
and Western blots (Figure 5). Furthermore, the structures of the particles formed by the 
tandem core proteins were indistinguishable from the structure of native core particles in 
cryo-electronmicroscopy (Figure 6). 

30 
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CLAIMS 

1 . A protein comprising tandem copies of hepatitis B core antigen (HBcAg). 
5 • 

2. A protein according to claim 1 which is a dimer of two copies of HBcAg. 

3. A protein according to claim 1 or 2 wherein one or more of the copies of HBcAg 
has a heterologous epitope in the el loop. 

10 

4. A protein according to claim 3 wherein all the copies of HBcAg have a 
heterologous epitope in the el loop. 

5. A protein according to claim 4 wherein all the copies have the same heterologous 
1 5 epitope in the el loop. 

• 6. A protein according to claim 4 wherein each copy has a different heterologous 
epitope in the el loop. 

20 7. A protein according to any one of claims 3 to 6 wherein the or each heterologous; 
epitope is from the pre-Sl or pre-S2 region of hepatitis B virus (HBV). 

8. A protein according to any one of claims 3 to 7 wherein the or each heterologous 
epitope is in a heterologous sequence of from 10 to 120 amino acid residues in the 

25 el loop. 

9. A protein according to any one of the preceding claims wherein one or more of the 
copies of HBcAg is truncated at the C-terminus. 
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10. A protein according to any one of the preceding claims wherein the tandem copies 
of HBcAg are j oined by a linker. 

11. A protein according to claim 1 0 wherein the linker is at least 1 .5 nm in length. 

5 

12. A protein according to claim 10 or 1 1 wherein the linker comprises multiple copies 
of the sequence GlyGlySer (GGS). 

13 . A protein according to claim 12 wherein the linker comprises 5, 6 or 7 copies of 
10 the sequence GGS. 

14. A particle comprising multiple copies of a protein as claimed in any one of the 
preceding claims. 

15 15. A nucleic acid molecule encoding a protein as claimed in any one of claims 1 to 
13. 

16. A nucleic acid molecule according to claim 1 5 which is an expression vector. 
20 17. A host cell comprising a nucleic acid molecule as claimed in claim 15 or 16. 

18. A process for producing a protein as claimed in any one of claims 1 to 13, which 
process comprises culturing a host cell containing a nucleic acid molecule which 
encodes the protein under conditions in which the protein is expressed, and 

25 recovering the protein. 

19. A pharmaceutical composition comprising a protein as claimed in any one of 
claims 1 to 13, a particle as claimed in claim 14 or a nucleic acid molecule as 
claimed in claim 15 or 16 and a pharmaceutically acceptable carrier or diluent. 

30 
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20. A protein according to any one of claims 1 to 13, a particle according to claim 14 
or a nucleic acid molecule according to claim 15 or 16 for use in a method of 
prophylactic or therapeutic vaccination of the human or animal body. 

21 . A protein, particle or nucleic acid molecule according to claim 20 for use in a 

5 method of prophylactic or therapeutic vaccination of the human or animal body 

against HBV. 

22. Use of a protein according to any one of claims 1 to 13, a particle according to 
claim 14 or a nucleic acid molecule according to claim 15 or 16 for the 

1 0 manufacture of a medicament for prophylactic or therapeutic vaccination of the 

human or animal body against HBV. 

23 . A method of prophylactic or therapeutic vaccination of a subject, which method 
comprises administering to the subject a protein as claimed in any one of claims 1 

15 to 13, a particle as claimed in claim 14 or a nucleic acid molecule as claimed in 

claimed 15 or 16. 
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Fig.4. 
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Fig.5. 

Hetero-tandem cores 



HBc GGS5 GGS6 GGS7 




38-39 kDa 



SUBSTITUTE SHEET (RULE 26) 



A 



WO 01/77158 



PCT/GB01/01607 




WO 01/77158 



PCT/GB01/01607 



SEQUENCE LISTING 



<110> MEDEVA EUROPE LIMITED 

<120> HEPATITIS B CORE ANTIGEN FUSION PROTEINS 

<130> N79405 

<160> 2 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 639 
<212> DNA 

<213> Hepatitis B virus 

<220> • 
<221> CDS 
<222> (1)..(639) 

<400> 1 

atg caa ctt ttt cac etc tgc eta ate ate tct tgt tea tgt cct act 48 
Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 

gtt caa gee tec aag ctg tgc ctt ggg tgg ctt tgg ggc atg gae ate 96 
Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
20 . 25 30 

gac cct tat aaa gaa ttt gga get act gtg gag tta etc teg ttt ttg 144 
Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 . 40 45 

cct tct gac ttc ttt cct tea gta cga gat ctt eta gat ace gee tea 192 
Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

get ctg tat egg gaa gee tta gag tct cct gag cat tgt tea cct cac 240 
Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

cat act gca etc agg caa gca att ctt tgc tgg ggg gaa eta atg act 288 
His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
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85 










90 










95 






eta get ace tgg 
Leu Ala Thr Trp 
100 


gtg 

Val 


ggt 

Gly 


gtt 
Val 


aat 
Asn 


ttg 
Leu 
105 


gaa 
Glu 


gat 
Asp 


cca 
Pro 


gcg 
Ala 


tct 
Ser 
110 


aga 
Arg 


gac 
Asp 


336 


eta gta gtc agt 
Leu Val Val Ser 
115 


tat 
Tyr 


gtc 
Val 


aac 
Asn 


act 
Thr 
120 


aat 
Asn 


atg 
Met 


ggc 

Gly 


eta 
Leu 


aag 
Lys 
125 


ttc 
Phe 


agg 
Arg 


caa 
Gin 


384 


etc ttg tgg ttt 
Leu Leu Trp Phe 
130 


cac 
His 


att 
lie 


tct 
Ser 
135 


tgt 

Cys 


etc 
Leu 


act 
Thr 


ttt 
Phe 


gga 

Gly 
140 


aga 
Arg 


gaa 
Glu 


aca 
Thr 


gtt 
Val 


432 


ata gag tat ttg 
He Glu Tyr Leu 
145 


gtg 

Val 


tct 
Ser 
150 


ttc 
Phe 


gga 

Gly 


gtg 

Val 


tgg 
Trp 


att 
He 
155 


cgc 
Arg 


act 
Thr 


cct 
Pro 


cca 
Pro 


get 
Ala 
160 


480 


tat aga cca cca 
Tyr Arg Pro Pro 


aat 
Asn 
155 


gee 
Ala 


cct 
Pro 


ate 
He 


eta 
Leu 


tea 
Ser 
170 


aca 
Thr 


ctt 
Leu 


ccg 
Pro 


gag 
Glu 


act 
Thr 
175 


act 
Thr 


528 


gtt gtt aga cga 
Val Val Arg Arg 
180 


cga 
Arg 


ggc 
Gly 


agg 
Arg 


tec 
Ser 


cct 
Pro 
185 


aga 
Arg 


aga 
Arg 


aga 
Arg 


act 
Thr 


ccc 
Pro 
190 


teg 
Ser 


cct 
Pro 


576 


cgc aga cga agg 
Arg Arg Arg Arg 
195 


tct 
Ser 


caa 
Gin 


teg 
Ser 


ccg 
Pro 
200 


cgt 
Arg 


cgc 
Arg 


aga 
Arg 


aga 
Arg 


tct 
Ser 
205 


caa 
Gin 


tct 
Ser 


egg 
Arg 


624 


gaa tct caa tgt 
Glu Ser Gin Cys 
210 


tag 
























639 


<210> 2 
<211> 212 
<212> PRT 
<213> Hepatitis 


B virus 
























<400> 2 

Met Gin Leu Phe 
1 


His Leu 
5 


Cys 


Leu 


He 


lie 
10 


Ser 


Cys 


Ser 


Cys 


Pro 
15 


Thr 




Val Gin Ala Ser 


Lys Leu 


Cys 


Leu 


Gly 


Trp 


Leu 


Trp 


Gly 


Met 


Asp 


He 
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20 



25 



30 



.Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 
35 40 .45 

Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Al a Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 ' 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 . 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 . 200 205 



Glu Ser Gin Cys 
210 
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